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ABSTRACT 

Tliis  paper  is  a  shortened  version  of  [1],  its 
basic  purpose  being  to  provide  an  easily  accessible  in¬ 
troduction  to  the  results  of  [1],  many  of  which  are 
presented  here  without  proofs.  However,  we  have  tried 
to  rearrange  the  material  of  [1],  changing  the  logical 
order  in  which  various  topics  are  introduced,  and  occa¬ 
sionally  we  regard  the  results  from  a  somewhat  different 
angle.  This  has  been  done  to  increase  the  present 
paper's  usefulness  as  a  complement  to  [1]. 

The  work  rcixirted  here  is  aimed  at  providing  a 
theory  of  smoothing  in  the  context  of  stochastic  reali¬ 
zation  theory.'  This  approach  enables  us  to  obtain  sto¬ 
chastic  interpretations  of  many  important  smoothing 
formulas  and  to  explain  the  relationship  between  them. 

In  this  paper,  however,  we  shall  only  consider  one  such 
formula,  namely  the  Mayne-Fraser  two-filter  formula, 
which  has  a  very  natural  interpretation  in  the  stochas¬ 
tic  realization  setting;  we  refer  the  reader  to  [1]  for 
further  results.  As  a  by-product,  we  also  obtain  cer¬ 
tain  results  on  the  stochastic  realization  problem 
Itself. 

1.  INTRODUCTION 

Consider  a  linear  stochastic  system 


dx  .  A(t)x(t)dt  ♦  B(t)dw  ;  x(0)  »  C  (1-la) 
dy  -  C(t)x(t)dt  ♦  0(t)dw  ;  y(0)  ■  0  (1.1b) 


defined  on  the  interval  0  s  t  s  T,  where  x  is  the  n-dira- 
cnsional  state  proceea,  y  is  the  m-dimensional  output 
proeeta,  w  is  a  p-dimensional  process  with  orthogonal 
increments  such  that 

E(dw}  •  0;  Eidwdw")  •  Idt  (1.2) 

(prime  denotes  transposition) ,  E  is  a  centered  random 
vector  with  finite  covariance  II  rwElEC")  snd  uneorrolated 
with  w,  and  A,  B,  C,  and  D  are  matrices  of  bounded  func¬ 
tions  with  properties  to  be  further  specified  below.  We 
shall  consider  two  problems  related  to  such  systems: 

Problem  1 .  For  an  arbitrary  t  c  [0,T],  find  the  linear 
least-squares  estiMte  x(t)  of  the  state  x(t)  given  the 
output  record  {y(t)  ;  0  s  t  s  T),  i  .e. ,  find  the  vide 
sense  eonditicnal  msun 

«(t)  •  E{x(t)iy(T)  ;  0  s  T  s  T).  (1.3) 

This  is  the  emoethina  problen,  which  h.is  genersted  a 
rather  extensive  literature  [1-18,  48],  and  is  of  con¬ 
siderable  importance  in  applications. 


*  This  research  was  supported  partially  by  the  Airforce 
Office  of  Scientific  Research,  USAF  Systems  Command, 
under  grant  No.  APOSR-78-3519,  and  partially  by  the  Na¬ 
tional  Science  Foundation  under  grant  No.  ENC-79-03731 . 


Problem  2.  Given  the  stochastic  process  {y(t)  ; 

0  s  t  S'T),  find  all  possible  systems  (1.1)  (in  some 
suitable  class  of  models  S)  having  this  process  as  its 
output  process.  This  is  the  atochastie  realitation  pro¬ 
blem  discussed  in  [20-33],  and  each  such  model  S  is 
called  a  stochastic  realization  of  {y(t)  ;  Os  t  s  T). 
Note  that  we  are  only  considering  proper  stochastic  re¬ 
alizations  [20],  i.e.,  models  S  whose  outputs  not  nicrcl> 
have  the  same  covariance  properties  as  the  given  process 
(the  only  requirement  in  the  e.^rlier  re.al izntion  theory 
[34-38]),  but  are  equal  to  it  s.s.  for  each  t. 

As  we  shall  see  in  this  p;ipcr,  these  two  problcras 
are  intimately  connected  to  each  other.  In  f.-ict,  .til 
the  well-known  smoothing  foriailas  found  in  [2-i8j  h.tve  ' 
natural  interpretations  in  the  stochastic  realization 
setting:  see  [1]  for  a  more  complete  discussion  of  these 
results.  Here  we  shall  only  consider  the  so-called 
Mayne-Fraaer  tuo-filter  formula  [S,6],  on  which  topic  a 
large  number  of  papers  have  been  written  [7-9,12-lT]. 

The  many  atten^ts  to  motivate  this  formula  stochastically 
have,  in  our  opinion,  been  less  than  convincing,  tve 
refer  the  reader  to  [48]  for  a  well -written  account  of 
these  matters.  In  our  realization  setting,  however, 
the  two  filters  have  a  natural  interpretation;  they  are 
simply  the  minimum-  and  maximum-variance  realizations 
respectively.  Hence,  the  latter  is  not  a  "backward 
filter"  as  suggested  in  the  literature  (although  it  can 
be  reformulated  as  such),  but  a  "forward  filter"  just  as 
its  structure  suggests. 

Hie  concept  of  baoheard  I'eaiiaaticn  is  an  essen¬ 
tial  tool  in  this  paper.  A  similar  approach  was  applied 
to  the  smoothing  problem  in  the  earlier  papers  I14-;T], 
but,  since  only  "wide  sense"  b.ackward  representations 
were  used,  some  subtle  points  were  overlooked.  The  fun- 
damontal  idea  of  this  paper,  to  embed  the  given  system 
(1.1)  into  a  class  of  stochastic  realizations,  was  kiq. 
tivated  by  the  results  in  [20-22].  Note  that  restrict¬ 
ing  our  analysis  to  models  (1.1)  for  which  BD'  •  0  (as 
in  [14-17]),  would  render  the  natural  class  of  realiza¬ 
tions  incomplete,  since  it  would  exclude  the  minimum- 
and  maximum- variance  realizations. 

This  paper  is  essentially  a  shortened  conference 
version  of  [1],  but  the  last  section  contains  some  as¬ 
pects  on  the  stochastic  realization  problem  not  included 
in  [1].  Mhenever  a  proof  hat  been  omitted,  it  can  be 
found  in  [1]. 

2.  lOFi  NOTATIONS 

Lot  H  be  the  space  of  all  centered  stoch.astic  v.iri- 
oblcs  (on  an  underlying  probability  space)  with  finite 
second-order  moments.  Then  11  is  a  llilbort  sp.ace  witli 
inner  product  (E,n)  ■  r.lCn).  For  an  arbitrary  k-diiucn-  ( 
sional  stochastic  process  {z(t);  0  s  t  s  T)  with  comiwi- 
nents  in  H,  define  ll^iz)  to  be  the  (closed)  subspace 

spanned  by  the  random  variables  {zj(t),  t,(t) .  , 

z^(t);,  and  let  ll(z)  be  the  closed  linear  hull  in  It  of 
the  subspaccs  {H^(z);  0  s  t  s  T);  we  shall  write  this  as 
H(»)  H^(z).  Similarly  define  the  past  space 
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nils)  t]  "t^*)  =* 

V  ,r  Ti  n  (z)-  Sometimes  we  sliall  be  more  interested 

TS^C^TJ  T 

in  spaces  spanned  by  the  itierements  of  z.  Hence,  we 
define  H(dz),  lii(dz3  Hi(dz}  to  be  the  closed  linear 

hulls  in  H  of  {z(t)  -  z(o) ;  t,  o  <  1}  where  I  is  the 
interval  [O.T],  [O.t]  and  [t.T]  respectively. 

For  each  n  <  n  and  subspace  K  c  H  let  d{>i|K}  be 
the  projection  of  n  onto  K,  i.e.,  the  uide  a«nae  oon~ 
diticnal  mean.  Let  u  be  a  stochastic  vector  with  com¬ 
ponents  in  II,  and  let  il(u]  be  the  closed  linear  span  in 
H  of  the  componentj;  of  u.  Then,  for  any  n  e  H,  we 
shall  often  write  li{  nl  u^)  in  place  of  'E{n  I  H(u)},  and, 
for  any  subspace^K  c  H,  E{  ui  K)  will  denote  the  vector 
with  components  E{Uj^|K}. 

3.  FORWARD  AND  BACKWARD  STOCHASTIC  REALIZATIONS 

Assuming  th.-it  R  :»  DD'  has  a  bounded  inverse  on 
[O.T],  it  is  well-known  that  the  linear  least-squares 
estimate 

x.(t)  -  E{x(t)|Hi(dy)}  -  (3.1) 

of  the  state  process  x  of  S  is  generated  on  [0,T]  by 
the  Kalman-Buey  filter 

dx,  *  Ax.dt  +  B*R*^^^(dy  -  Cx.dt) ; 


where  R*'^(t)  is  the  symmetric  square  root  of  R(t),  and 
the  gain  function  B,  is  given  by 

A  _  /rt  ..  (3,2b) 


B.  -  (Q.C'  ♦  B0')R 
the  error  covariance  matrix 


q.(t)  A  E{[x(t)  -  x.(t)][x(t)  -  x.(t)]'}  (3.2c) 

being  the  solution  of  the  matrix  Riccati  equation 
[Q.  -  AQ.  ♦  Q,A' 

I  -  (Q.C'  ♦  BO')R'\q.C'  ♦  BD')'  ♦  BB'  (3. 2d) 
[Q.(0)  -  n. 

Note  that  the  filter  (3.2a),  and  hence  the  estimate  x,, 
is  completely  determined  by  the  matrices  A,  C,  R  and 
B..  Clearly  there  are  many  models  S  having  the  same 
Kalman-Bucy  filter. 

In  the  sequel  we  shall  only  consider  models  S 
which  are  minimal,  i.e.,  there  is  no  other  realization 
of  (y(t);  0  S  t  s  T)  with  a  state  process  x  of  smaller 
dimension  n,  and  analytic,  i.e.,  the  coefficient  ma¬ 
trices  A,  D,  C,  n  ,ind  R*1  are  analytic  on  [0,T].  Both 
these  assumptions  are  purely  technical  and  iire  intro¬ 
duced  to  insure  that  a  certain  matrix  function  is  in¬ 
vertible;  they  could  probably  be  removed  at  the  price 
of  a  less  elegant  theory.  Now,  let  the  initial  reali¬ 
zation  S  used  in  fbrming  (3.2)  be  minimal  and  analytic, 
and  define  S  to  be  the  class  of  all  analytic  realiza¬ 
tions  of  {y(t);  0  s  t  s  T)  having  (3.2)  as  its  Kalman- 
Buey  filter.  Then  all  realizations  of  class  S  are 
minimal.  Clearly  A,  C  and  R  :«  DD'  are  the  same  for 
all  S  <  S,  while  B,  D  and  the  state  eevariance  function 

P(t)  :•  E(x(t)x(t)'}  (3.3) 

will  differ  ever  the  cl.sss  S.  Of  course,  different 
S  t  S  will  have  completely  different  stochastic  pro¬ 
cesses  X  and  w. 


Furthermore,  from  (3.2b)  and  the  fact  that  Q,  «  P  -  r., 
where  P,(t)  :«  E{x,(t)x,(t)') ,  it  follows  that  also  the 
function 

G  :«  PC:  *  BD'  (3.4) 

is  an  invariant  over.S.  In  fact, 

G  •  P.C'  ♦  B.R^^^.  (3.5) 

It  is  easy  to  see  that  P  satisfies  the  matrix 
differential  equation 

P  A  AP  ♦  PA'  ♦  BB'  ;  P(0)  -  n.  (3.6) 

which  has  the  solution 

P(t)  A  ♦(t,o)n6(t,o)' 


f  ♦(t. 


T)B(T)B(T)'*(t.-)'dT 


where  4'  is  the  transion  matrix  of  the  system  :  •  A:  of 
differential  equations.  Hence,  P(t)  >  0  for  all  t  t  [0,Tj 
if  and  only  if  S  belongs  to  the  subclass  5.  >  (SeSuT  >  0;. 
[For  symmetric  matrices  P  and  Q,  P  a  Q  (P  >  Q)  means  that 
P-Q  is  nonnegative  (positive)definite. ]  It  can  be  shown 
[1]  that  is  nonempty. 

Let  S  f  S^.  Then  x(t)  :•  P(t)*^x(t)  is  a  well- 
defined  stochastic  vector  process  on  all  of  [O.T],  and 
it  can  be  shown  [1;  Lemma  2.3]  that  it  satisfies  the 
baekjord  Markovian  representation 


dx  A  -A'xdt  ♦  Bdw  ;  x(7) 


(3.8a) 


where  B  «  P  B,  w  is  a  p-dimensional  orthogonal  increment 
process  of  t>’pe  (1.2)  defined  by 

dw  >  dw  -  B'P'^xdt,  (3.5b) 

and  X  :»  P(T)"^x(T)  is  imcorrelated  with  w.  Then 
H'(dw)  i  ll*(x)  for  all  t  e  [O.T];  this  is  what  char.icter- 
iied  the  backuard  property  of  (3.S).  Moreover,  the  state 
covariance  function  P(t)  :•  E(x(t)iS(tj')  satisfies  P  « 
p-1. 

Representation  (3.8)  is  a  strict  sense  version  of 
a  similar  result  presented  in  [15,16,42].  (The  last 
paper  contains  an  alternative  justification  of  the  formu¬ 
las  of  [15,16]  using  the  techniques  of  [12,13].)  Tiie 
version  given  in  these  ]>apers  is  however  insufficient  for 
our  purposes  since  it  provides  a  representation  up  to 
second-order  properties  only.  Modulo  some  trivial  tech¬ 
nicalities,  the  proof  of  (3.8)  above  [1;  Lemma  2.3]  is 
the  same  as  the  one  presented  in  [20].  (In  this  context 
it  should  be  mentioned  that  all  the  basic  ideas  of  .a  re¬ 
cent  p.tpcr  co.iiitiiorcd  by  Kailath  [IEEE  Trans.  lT-25(19Ti'1 . 
p. 121 -124]  are  contained  in  [20,21],  and  that,  three 
months  prior  to  the  submission  of  Kail.nth's  p.nper  .nul  iic 
his  request.  [20,21]  were  personally  handed  over  to  him 
by  one  of  the  authors.) 

Together  with  (3.4),  representation  (3.8)  yields 
a  baoheard  realiaation  of  (y(t)  ;  0  *  t  s  T),  namely 

f  d*  A  -A'xdt  ♦  ids  ;  x(T)  a  j 
(S)  ‘  (3.9) 

I  dy  A  c'xdt  ♦  Ddw 

i 

whose  state  covariance  matrix  function  P  satisfies 

p  A  -A'P  -  PA  -  ii';  P(T)  A  ii,  (3.10) 

where  n  :a  P(T)*^.  By  this  procedure  each  S  t  gives 
rise  to  a  backward  realization  S;  the  class  of  all  S 


(5.2) 


Rcncr.iteii  in  this  way  will  be  denoted 


Next  we  proceed  to  enl.nrce  the  class  Given 
.nny  S  t  S*.  (by  symmetry  with  the  forward  setting)  the 
line.ir  least  squares  estimate 


x.(t)  »  n{x(t)  |ll*(dy)} 

is  generated  by  the  hackuaed  Kalman-Bucy  filter 
dx.  •  -A'x.dt  ♦  B,R'^^*(dy  -  C'x.dt)  ; 
x.(T)  .  0. 


(3.H) 


(3.12) 


where  the  gain  function  can  be  determined  via  a  ma> 
trix  Riccati  equation  [1].  Now,  in  complete  analogy 
with  the  forward  setting,  we  define  S  to  be  the  class 
of  all  analytic  backward  realizations  of  {y(t); 

0  s  t  s  T}  whose  backward  Kalmiin-Bucy  filter  is  given 
by  (.>.12).  It  can  be  shown  fl]  that  5^  c  5,  and  hence 
we  have  obtained  the  required  extension.  All  realiza¬ 
tions  of  class  5  are  miniDal . 


■  Unfortunately,  there  is  no  one-one  correspondence 
between  models  in  S  and  S.  For  this  we  need  to  en¬ 
large  these  classes  even  further.  Tliis  leads  to  gen¬ 
eralized  stochastic  realizatiotxs . 


4.  GENERALIZED  STOCHASTIC  REALIZATIONS 

In  order  to  extend  the  one-one  correspondence  be¬ 
tween  forward  and  backward  realizations  be.vond  and 
5^  we  shall  have  to  enlarge  S  and  S  slightly  in  the 
following  way.  bet  5  be  the  class  of  all  systems  (1.1) 
which  for  any  e  >  0  is  an  analytic  realization  of 
{y(t);  0  s  t  s  T  -  eJ  having  (3.2a),  restricted  to  the 
interval  (0,T-c],  as  its  Kalman-Rucy  filter.  Similarly, 
we  define  $  to  be  the  class  of  all  models  (3.9)  which 
for  any  c  »  0  is  an  analytic  realization  of  {y(t): 
e  s  t  s  T)  such  that  (3.12),  restricted  to  [c,T],  is^it.< 
backward  Kalmnn-Bucy  filter.  Tlie  elements  of  S  and  S 
will  be  called  generalized  realizations  and  generalized 
baaloiard  realizations  gf  {y(t);  0  s  t  s  T)  respectively. 
Clearly  S  «  5  and  S  = 

Then  to  each  realization  S  <  5  thgre  corresponds 
a  generalized  backward  realization  S  e  5.  In  fact,  it 
can  be  shown  that,  since  S  is  minimal,  (A,J1)  is  com¬ 
pletely  coiitrollnblc.  Tins  together  with  the  analyti- 
city  implies  that  (A,B)  is  eotollo  controllable  [40,41]. 
Consequently  P(t)  has  an  analytic  inverse  on  any  inter¬ 
val  [c.T],  for  the  la.st  term  of  (3.7)  is  the  controlla¬ 
bility  grnmian.  (Cf  [1];  Lemma  2.2).  Hence  the  proce¬ 
dure  lending  to  the  backward  model  (3.9)  can  always  be 
carried  out  on  the  restricted  interval  [c,T].  Similarly 
there  corresponds  a  generalized  realization  S  <  S  to 
any  backward  realization  S  t  S.  We  collect  these  obser¬ 
vations  in  the  following  theorem. 

THEOREM  4 . ] .  To  each  realization  (1.1)  in  S  there  cox- 
reeponas  a jgeneralized  backward  realization  (i.9)  in  $ 
riicli  that  r  «  P*',  fl  «  P"'ll,  X  ■  P*^x  and  uw  ■  dw  -  ■ 
B'P-'xdt.  hikewiae  to  each  bnch.iard  realiiiation  (Z.3) 
ill  S  l.lici’r  in  ft  yimrf.tii:'-r.ti  rfniii-iticn  (I.J)  in  !>  luic'/i 
that  P  «  P"*,  B  ■  P"*6,  X  »  and  dw  »  d8  •»  BT’"^Xdt. 


5.  THE  MimHUH-  AHO  MAXIMUM-  VARIANCE  REALIZATIONS 

It  is  well-known  that  the  innovation  proce.te 
(w,(t):  Os  t  !  T),  whose  increments  arc  defined  by 

dw.  .  R‘‘^‘fdy  -  Cx.dtj,  (5.1) 

is  a  process  with  orthogonal  increments  satisfying 
(1.21  and  li*(dw.)  •  ii*{dyj  for  all  t  x  [0,T]  (see  e.g. 
[43]).  Theii  (3.2a)  aAd  (S.i)  yield 
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[ 

|dx.  ■  Ax.dt  ♦  B.dw.  ;  x.(0)  »  0 
1/2 

|dy  •  Cx.dt  ♦  R*'  dw,, 

which  is  a  realization  in  S,  for  B.  is  clearly  analytic. 

Likewise,  the  backward  Kalman-Bucy  filter  (3.12) 
can  be  written 

[dx,  •  -A'x.dt  ♦  B,dw,;  x,  (T)  «  0 


'  *•’1  -  1/2  - 
dy  «  C'x.dt  ♦  R*'  dw,, 

where  {w,(t) ;  0  s  t  £  T)  is  the  backjard  irjtovation 
cess 


(5.3) 

piv- 


dw. 


R'^^^(dy  -  C'x.dt), 


(5.4) 


which  has  orthogonal  increments  and  satisfies  (1.2)  and 
the  condition  nr{dw«)  •  Hj(dy}  for  all  t  e  (0,T].  (See 
[20,45].)  Clearly,  S,  eS.  Now  let 


dx*  '  Ax*dt  ♦ 
dy  •  Cx*dt  ♦  R 


B*dw 

1/2 


dw* 


x*(0) 


c*. 


(5.5) 


be  the  forward  counterpart  of  S.  as  defined  by  Theorem 
4.1,  and  let  P*  be  the  corresponding  state  covariance 
function.  Since  5.  /  S*  exists  only  as  a  gcneralizcil 
realization,  and  obviously  l’*(t)  as  t  ■»  T. 


LEMMA  5.1 .  Let  P,  and  P*  be  the  state  covariance  func¬ 
tions  of  S.  and  S*  respectively  and  define  Q  :•  P*  -  P,. 
Then  the  state  covariance  function  P  of  an  arbitrsi'u 
realization  S  e  S  satisifiee 


P.(t)  S  P(t)  S  P*Ct)  (5.6) 

for  all  t  e  [0,T).  Moreover,  Q  >  0. 

Consequently,  we  shall  call  S,  the  ainir-r.tr:-  and  S* 
the  naximum-variance  realization.  By  eliminating  dw*  in 
(S.S),  it  is  immediately  seen  that  x*  satisfies  the  Kal- 
raan-Bucy  type  equation 

dx*- «  Ax*dt  ♦  B*R'*'^^(dy  -  Cx*dt)  ;  .x*(0)  •  i*  iS.Ta) 

Let  S  be  an  arbitrary  rcnlitation  of  cl.iss  -S.  Viien.  de¬ 
fining  Q*  :•  P*  -  P,  it  is  not  hard  to  see  from  ^3.4) 
and  (3.6)  that 

B*  «  -(Q*C'  -  BD')R*'^^  iS.Tb) 

with  Q*  satisfying  the  matrix  Riccati  equation 

}q*  •  AQ*  ♦  Q*A'  *  (Q*C'  -  BD')R'‘(Q*C'  -  BD')'  -  Bt  ' 

\  iS.Tc, 

|q*(0)  •  n*  -  n, 

where  n*  •  P.CO)’*.  Clearly  Q*(t)  as  t  -•  T.  Ti.e 
filter  (5.7)  is  precisely  the  mysterious  "baexuard  fil¬ 
ter"  of  the  M.nync-Prascr  two-filter  formula,  as  we  nave 
seen  .above,  it  is  actually  a  forward  real  izat  1  cn .  .x.ncr 
<}*  «  P*  -  P,  wc  can  liifi-i-i'rv't  l)*  ns  .ui  ciror  rpvari.i.ui 
function,  much  in  analogy  with  the  kalman-ducy  filter. 

In  fact, 

Q*(t)  .  E{(xCt)  -  X*(t)]lx(t)  -  x*(t);'  (S.S; 

for  ail  t  i  (O.Tj.  Tins  is  an  iim.icui.itr  con«<'i;uinci'  of 
the  following  lemma,  which  we  shall  nerd  again  in  tnr 
next  section. 

LEMMA  5.2.  Let  x  be  the  etate  proeees  a'w_^P  zue  ata:.- 
aovarianee  function  of  any  realization  in  $,  Then 

E(x(t)x.(t)’)  -  P.(t).  E(x(t)x*(t)  •)  .  P(t)  (S.9) 


and 

E{[x(t)  -  x.(t)][x*(t)  -  x(t)]'}  .  0  (S.IO) 

on  any  intarvdl  on  which  theae  quantitiea  ara  defined. 

We  shall  now  demonstrate  that  the  two  processes 
X,  and  X*  together  contain  all  the  relevant  information 
on  y  needed  in  estimating  the  state  process  x  of  an  ar¬ 
bitrary  realization  S  <  S.  To  this  end  first  note  that 

(3.1)  can  be  written 

E(H^(x)lll'(dy)}  .  H^(x,).  (S.ll) 

and  that,  since  obviously  M^(x*)  ■  ll^(x^),  (3.11)  yields 

B{llj(x)lH*(dy)}  -  ll^(x*)  (S.12) 

for  all  t  €  {0,7} .  Now  define  the  orthogonal  com)>lc- 
ments  Kj  :«  lljCdy)  e  H^(x,)  and  S*  :•  ll^(dy)  o  ll^(x*) 
respectively.  Then  we  obtain  the  orthogonal  decomposi* 
tion 

H(dy)  -  N‘  •  h“  •  N*  (S.13) 

where  11°  is  the  frame  apace 

h“  «  H^(x,)  V  (5.14) 

(where  A  v  g  denotes  the  closed  linear  hull  in  H  of  A 
and  B.)  (Cf.  [22,24,26].) 


for  all  t  t  [0,T).-  Tlicii  solving  (0.4)  and  (6.5)  for  K, 
and  K*  we  obtain  K,  ■  <)*Q'^  and  K*  •  Q.Q"^,  where  as  be¬ 
fore  Q,  •  P  -  11* ,  Q*  •  P*  -  P  and  Q  •  P*  -  P, .  Note 
that  Q(t)  is  nonsingular  for  all  t  t  [0,7}  (Lemma  S.i) 
and  that 

Q(t)  ■  Q.(t)  ♦  Q*(t).  (6.6) 

THEOREM  6.1  ■  Let  x  be  the  etate  proaeaa  of  a  realization 

(1.1)  of  alaaa  S.  Then  .the  anoothing  estinate  (6.1)  ia 
given  by 

J(t)  •  [1  -  Q.(t)QCt)‘^]x.(t)  ♦  Q.(t)Q(t)*^'c*it)  (6.7) 
and  the  error  covax‘iance  function  (6.2)  by 

E(t)  •  Q.(t)  -  Q,(t)Q(t)*^Q.(t)  (6.8) 

for  all  t  t  [0,7). 

Proof.  Relation  (6.7)  was  derived  above  for  t  <  (0,7); 
for  t  «  0,  (6.7)  follows  from  (7.6)  below.  To  prove 
(6.8)  note  that 

X  -  i  -  (I  -  Q.Q'b(x  -  X.)  *  Q.Q’^x  -  X*).  (6.9) 

By  Lemma  5.2  the  two  terns  of  (6.9)  are  orthogonal  and 
therefore,  observing  (3.2c)  and  (5.8), 

I  ■  (I  -  Q.Q‘^)Q.(I  -  Q*’q.)  ♦  Q.Q‘^Q*Q'‘Q.. 


LEWA  5.3.  (cf.  [27])  Let  x  be  the  atate  proceaa  of  a 
realization  in  S.  Then,  for  t  e  (0,7), 

llj(x)  e  [H(dy)J-^ 

where  [ll(dy)]'''  ia  the  orthogonal  con^lement  o/ H(dy) 
in  il. 

Proof.  Clearly  Hj(x)  i  N^.  To  see  this  note  that  the 
components  of  x(tj  -  x*(t)  arc  orthogonal  to  Hj(dy}  m 
Nf  and  that  the  components  of  x*(t)  belong  to  H^(x*)  i 
N*.  In  the  same  way  we  show  that  Hj(x)  i  N^.  G 

6.  THE  SMOOTHING  PROBLEM 


Consider  an  arbitrary  realization  (1.1)  in  the 
class  3.  The  basic  problem  before  us  is  to  determine 
the  smoothing  estimate 

x(t)  -  E{x(t)lH(dy)}  (6.1) 

for  each  t  c  [0,7)  and  to  interpret  it  in  terms  of  sto¬ 
chastic  realizations.  Let  Z  denote  the  corresponding 
estimation  error  covariance,  i.e., 

I(t)  ■  E{[x(t)  -  x(t)][x(t)  -  x(t)]').  (6.2) 

In  view  of  Lemma  5.3,  x(t)  c  II?,  and  consequently 
there  are  two  matrix  functions  K*  and  K*  such  that 

x(t)  -  K.(t)x.(t)  ♦  K*(t)x*(t).  (6.3) 

TTte  components  of  the  estimation  error  x(t)  -  S(t)  are 
clearly  orthogonal  to  ll(dy) ,  and  honce,  in  particular, 
to  the  components  of  x,(t)  and  x*(t).  Tliorefore, 
E{x(t)x.(t)')  -  C{x(t)x.(t) '}  and  E{x(t)x*(t) • 
E{x(t)x*(t) ').  By  Lemma  5.2,  the  first  of  these  rela¬ 
tions  yields  P,  ■  K.P,  ♦  K*P,  and  consequently 

K,(t)  *  K*(t)  •  I  (6.4) 

for  all  t  (  (0,7).  because  P*(t)  is  nonsingulnr  on  this 
Interval.  The  second  relation  yields 

P(t)  •  K.(t)P,(t)  •  IC*(t)l**(t)  (6.5) 


which,  in  view  of  (6.6),  yields  (6.8).  D 
7.  THE  MAYNE-FRASER  SNOOTHINS  FORMULA 


We  shall  now  restrict  our  .nttention  to  re.-ilizntions 
for  which  both  Q,  and  Q*  arc  invertible  for  .irbitrary 
t  €  [0,7).  Since  Q«  •  P  -  P,  and  Q*  »  P*  -  P,  this  is 
possible  for  all  S  e  S  sucli  that  P,(t)  <  P(t)  <  P*(l)  for 
all  t  on  this  interval.  IVe  shall  call  the  class  of  all 
such  S  the  interior  of  S  and  denote  it  int  S.  It  can  be 
shown  that  intS  is  indeed  nonempty  [1;  Lenuna  3.6] . 

THEOREM  7.1,  Let  S  e  int  S,  let  x  be  the  atate  process 
of  S,  and  let  x  be  the  corresponding  smoothing  estirate 


(6.1).  Then,  for  each  t  e  [0,T) 

x(t)  .  E(t}[Q.(t)’^x,(t)  ♦  Q*(t)*‘x*(t)],  (7.1) 

where  x,  and  x*  are  given  by  (3.2)  and  (5.7)  respectively 
and  the  emoothing  error  covariance  Z  by 

■  E(t)_’^  •  Q.(t)'^  ♦  Q*(t)'^  (7.2) 

Proof.  Since  S  t  int  3,  Q,  and  Q*  are  invertible.  Pv 
writing  (6.8)  as  Z  •  Q*Q‘1(Q  -  Q.)  and  using  (6.6),  it 
is  seen  that 

E-Q,Q‘^Q*-  (7-5) 


Inverting  this  and  ag.-iin  us’ng  (6.6)_yields  (7.2).  Frem 
(7.3)  wc  also  see  that  Q,Q'^  •  Z(Q*)  '.  Tiien  I  -  Q.v)"*  • 
E[Z**  -  (Q*)'M  •  EQ;*.  Hence  (7.1)  follows  from  (6.7)  .C 


Relations  (7.1)  and  (7.2)  together  with  (3.2)  and 
(5.7)  is  the  Hoiine-Fraaer  two~ filter  fcmula  [5,6], 
which  has  received  considerable  attention  in  the  litera¬ 
ture  [7-9,13-17].  Altnough  this  algorithm  is  easy  to 
derive  formally  [9,12,13],  its  probabilistic  .lustifica- 
tion  has  caused  considerable  difficulty,  partly  due  to 
the  f.ict  that  Q'i't}  as  t  ••  T.  Tne  system  (5.7)  has 
usually  been  interpreted  as  a  backward  filter,  and  in 
[14,17]  It  is  presented  as  the  limit  of  such  a  filter  ns 
a  certain  covariunce  matrix  function  tends  to  infinity. 
However,  in  our  stochastic  realization  setting  (5.7)  has 
n  very  natural  interpretation:  It  is  simply  the  ma.xiimim- 
variance  foivard  realization  S*.-  By  using  the  identity 
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•'V.. 


X*(t)  •  P,(t)'*x,(t)  (7.4) 

we  can  instead  vnrite  the  smoothins  formula  (7.1)  in 
terms  of  two  Kalman-Bucy  filters,  one  (3.2)  evolving 
forward  and  the  other  (3.12)  evolving  backward  in  time. 
(Note  that  then  (7.1)  is  defined  on  the  whole  interval 
|o,T].)  This  fact  was  pointed  out  in  [14,15,17],  in 
which  papers  the  backward  estimate 

1i^(t)  -  E{x(t)|ll*(dy))  (7.5) 

was  used  in  place  of  X,.  a  choice  that  may  at  first 
sight  seem  more  natural.  The  reader  should  however  note 
that 

\(t)  ■  P(t)P*(t)-lx*(t)  (7.6) 

is  not  invariant  over  5  and  is  therefore  less  suitable 
for  our  purposes.  It  is  not  hard. to  see  that 

•  [(Q*)'^  +  P'^]P(P*)‘^  (7.7) 

and  consequently  (7.1)  may  also  be  written 

x(t)  -  £(t){Q.(t)‘^x,(t) 

♦  [Q*(t)‘^  ♦  P(t)'hxjj(t)},  (7.8) 

which  is  the  formula  presented  in  [14,15,17].  In  the 
early  papers  [7,8],  relation  (7.1)  was  introduced  via 
a  formula  [47]  for  optimal  weighting  of  two  estimates 
with  orthogonal  errors.  No  justification  of  this  or¬ 
thogonality  was  given  in  [8],  and  the  argument  in  [7]  is 
incomplete  due  to  problems  with  the  end  point  condition. 
(Cf.  [48].)  However,  the  stochastic  realization  theory 
provides  a  natural  justification  of  this  procedure.  In¬ 
deed,  (5.10)  is  the  required  orthogonally  condition. 

8.  INTERNAL  AND  EXTERNAL  REALIZATIONS 

Since  R  ;>  DO'  is  assumed  to  be  full  rank,  the 
dimension  of  w  is  always  greater  than  or  equal  to  the 
dimension  of  y,  i.e.,  ms  p.  iVe  shall  now  consider  re¬ 
alizations  for  which  m  >  p.  Then  0  is  invertible,  and 
w  can  be  eliminated  from  (1.1)  to  yield  the  Kslman-Bucy 
type  equation 

dx  •  Axdt  ♦  B0'^(dy  -  Cxdt) ;  x(0)  •  5.  (8.1) 

If,  in  addition,  we  assume  that  C  c  H(dy) ,  it  is  imme¬ 
diately  clear  that 

ll(x)  e  ll(dy).  (8.2) 

We  shall  call  any  generalized  realization  of  (y(t); 

0  s  t  s  T]  for  which. (8. 2)  holds  an  internal  realiza¬ 
tion;  all  other  S  <  S  will  be  named  external  [20].  Ob¬ 
viously,  the  internal  realizations  are  precisely  those 
for  wiiich  the  smoothing  problem  is  trivial,  the  esti¬ 
mate  being  exact.  In  particular,  and  S*  are  inter¬ 
nal. 

Let  So  be  the  class  of  all  S  c  S  such  that  p  ■  m 
and  C  «  H(d^).  Then  we  have  just  shown  that  all  reali¬ 
zations  in  Sq  are  internal.  Tlie  following  theorem,  the 
proof  of  which  is  given  in  [1],  states  that,  under  some 
adld  regularity  conditions,  the  converse  is  also  true. 

.  (a) 

THEOREM  8.1 .  4  rxatiaation  S  c  S  xuah  that  Itae  full 
rank  it  inttmal  if  and  only  if  S  c  Sq- 

t 

In  view  of  Tlicoren  6.1,  tne  state  proces.s  x  of 
any  realization  S  of  class  S  can  be  written 

z(t)  •  [I  -  a(t)]x,(t)  ♦  ii(t)x«(t)  ♦  x(t),  (8.3) 

whore  n  :■  Q«Q*^  ond  f  :■  x  •  S.  The  smoothing  error  x 


is  identically  zero  if  and  only  if  5  is  internal.  To  ob¬ 
tain  a  complete  characterization  of  the  external  re.ili- 
zations  in  S-,  we  shall  provide  a  representation  for  x 
also.  To  this,  end,  note  that,  given  a  realization  (1.1), 
tliere  exists  an  orthogonal  p  x  p-inatrix  V(t)  for  each 
t  t  [0,T]  such  that 


(t)j  pj(t) 

(tj 


B,(t), 


where  Bj  is  n  w  ni  and  B^  is  n  »  (p  .  m) ,  and  that 

fdJI 

j  »  V  dw 
[dv 


(S.4a) 


(8.4b) 


defines  a  pair  of  orthogonal  increment  processes  u  and  v, 
of  dimensions  m  and  p  -  m  respectively.  Obviously  (8.46) 
satisfies  (1.2). 

THEOREM  8.2.  Let  x  be  the  state  process  of  c  realizs- 
tion  S  €  and  let  Bj  and  v  be  defined  by  (S.4j.  7'nen 
the  smoothing  error  x  is  given  by 


3;(t)  •  Q.(t)n(t) 
dn  =  -rjndt  *  Ql^B,  d;;  ri(T) 


|dn  =  -rjndt  *  Q."B,  d;;  ri(T)  .  (S.Sb) 

where  P.  ■  A  -  B.R'^/^C,  *  Qr^(T)[xCT)  -  x,(7)]  c,’ui 

I,  is  a  Ip  -  in)  ‘dimensional  crthogorial  increment  prccess 
of  type  (7.2)  such  tfiat  H(dL)  1  ll(dy) .  Horeover, 
n-j-  X  H(dO.  i-e-t  (S.Sa)  is  a  backward  Markovian  repre¬ 
sentation  f3.SaJj  and  the  increments  of  (  tu'e  given  by 

dt  »  dv  -  BjQ^^ix  -  x,)dt.  (8.6) 

Together  with  (8.5),  (8.3)  constitutes  a  generali¬ 
zation  of  the  stationary  internal  state  representation 
in  Theorem  5.5  of  [20].  For  external  realizations,  how- 
ever,  n  is  not  a  projection.  In  fact,  li  is  a  projection 
if  and  only  if  S  is  internal .  To  see  this  observe  that 
Jr  “  n,  i.e.,  Q,Q**Q,  •  Q»,  if  and  only  if  I  •  0 
(Tlieorem  6.1). 

For  internal  realizations  we  have  the  following 
stronger  result,  which  illustrates  the  important  role 
played  by  the  feedback  matrix  F,  defined  in  Theorem  S.2. 
It  is  a  generalization  of  a  result  found  in  (72;  pp.75- 
79]. 

THEOREM  8.3.  Let  Y  be  the  transition  function  of 

I  •  > 

•^Y(t,s)  -  r.(t)Y(t,s);  Y(s,s)  -  I  (8.7) 

Then  xyis  a  state  process  of  an  intertial  realisation  of 
class  S  if  and  only  if  there  is  a  family  {.'1^;  t  t  (0,T)} 
of  subspaoes  of  R",  satisfying  the  condition 

V(t,s)Mj  c  for  all  s  s  t,  (8.S) 

such  that,  for  each  X.  t  [0,T), 

x(t)  •  [I  -  n(t)]x.(t)  ♦  n(t)x‘(t),  (8.9) 

where  II(t)  is  a  projection  onto  H*  along  Q(t)Mj,  Q(t) 
being  the  covariance  matrix  cf  x*Ct)  -  x.Ct).  Then  il 
is  given  by 


(8.10) 


where  Q.  is  defined  by  (3.Zc). 


Proof.  Ke  shall  use  the  tame  idea  of  proof  ns  in  [22]. 
(only  if):  Let  x  be  .the  state  process  of  an  internal  re¬ 
alization  of  class  S.  Then,  by  Theorem  6.1,  x  satisfies 
(B.9)  with  n  given  by  (8.10).  Ke  just  proved  that  R  is 


S 


a  projection  (onto  some  subspace  >  and  the  fact  that 
n(t]Q(t)  is  symmetric  implies  that  i'((t)  projects  along 
Q(t>if.  It  just  remains  to  show  that  the  family  (M^; 
t<  [O.T)}  of  subspaces  satisfies  (8.8j,  or,  which  is 
equivalent, 

n(t)'P(t,s)n(sD  »  ¥(t,s)n(s)  for  all  t  k  s.  (8.11) 
To  this  end,  first  note  that  (8.9)  can  be  written 

2.  -nr  (8.12) 

where  z,  :•  x  -  x«  and  z  :«  x*  -  x, .  rrom  the  differ¬ 
ential  equations  for  z.  .and  z  it  is^immcdi.ately  seen 
that  n{z(t)lz(s))  •  V(t,s)z(s)  and  E{ z.(t)  I  z,(s))  - 
T(t,s)z,(s).  Projecting  the  first  of  these  relations 
over  lljCz.)  and  premultiplying  by  n(t),  one  obtains 

E{z,(t)lz.(s)}  •  II(t)V{t.s)n(s)zCs).  (8.13) 

for,  in  view  of  (8.12),  Hjfz*)  c  115(2)  and,  by  Lemma  5.2 
and  the  usual  projection  formula  [1;  Lemma  2.1], 

E{z(s)i z,(s))  »  Q.(s)Q(s)‘^2(s).  Then  comparing  (8.13) 
with  the  second  of  the  formulas  in  the  text  above  (8.13), 
(8.11)  follows  by  noting  (8.12)  and  the  fact  that  Q(s)  « 
E(z(s)z(s)'}  >  0  (Lemma  S.l).  (if):  Let  {II(t); 

t  t  [0,T])  be  a  family  of  projections  satisfying  the 
conditions  of  the  theorem.  Since  Q  :■  E{z(t)2(t)')  and 
Q,  :•  E(z»(t)2*(t) '),  it  is  immediately  seen  that  it  is 
given  by  (8.10).  In  fact,  by  (8.12),  Q*  '  ilQIl '  «  n^Q  = 
nQ,  for,  since  n(t)  projects  along  Q(t).'t^, 

on'  «  no-  (8.14) 

tereover,  since 

HjCz*)  c  H^(z)  1  H;(dy)  (8.15) 

[see  (8.12)],  (3.1)  holds.  Hence,  it  only  remains  to 
prove  that  x  is  actually  the  state  process  of  a  reali¬ 
zation  S  with  the  prescribed  values  of  A,  C  and  R;  then 
B,  will  have  the  right  value  also,  and,  in  view  of 
(8.9),  S  must  be  internal.  To  this  end,  first  note 
that,  in  view  of  (5.2)  and  (5.5),  z  satisfies  the  dif¬ 
ferential  equation  ' 

dz  -  r.zdt  -  QC’R'^^^dW;  i(0)  •  (8.16) 

We  need  to  prove  that 

E{z(t)IH'(z.))  -  1'(t.s)z.(s)  for  t  k  s.  (8.17) 

But  (8.17)  is  equivalent  to  E{z(t)z,(T) ’)  ■ 
'K(t,s)E(z.(s)z,(T) '}  for  all  t  s  s,  and,  by  (8.12)  and 
(8.16),  this  is  the  same  as 

’r(t,T)Q(T)n(T) '  -  r(t,s)ii(s)'i'(s,T)Q(T)n(T) (8.18) 

which  is  an  immediate  consequence  of  (8.11)  and  (8.14). 
Then,  prcmultiplying  (8.17)  by  n(t)  and  using  (8.11) 
and  (8.12),  we  have 

2lz.(t)IH‘(z.))  -  'r(t,s)z.(s)  for  t  k  s.  (8.19) 
Now,  inserting 

dw,  -  R'^^^Czdt  «  dw»  (8.20) 

into  the  state  equation  of  S.  we  see  that 
i(x.(t)lll*{zj}  -  «(t,s)E{x,(s)lir(z.)) 

*  ^%(t,T)B.fl)R‘*^^T)C(T)y(T,S)dTZ,(s).  (8.21) 

Is 

where  we  have  used  (8.17)  to  obtain  the  last  term. 

Adding  (8.19)  and  (8.21)  yields 


n{x(t)illj(:.))  *  «(t,s)Ei  x(s)iiij(z.))  for  t  k  s,  (8.22) 

for  the  last  terra  of  (8.21)  can  be  written 
U-(t,s)  -  ¥(t,s)]r,(s)  [39;  p.ll7].  Ibreover. 

£tx(t)|H‘(dy))  »  (Ji(t,s)E{x(s),H*(dy)}  for  t  k  s.  (8.25) 

To  see  this,  replace  x  by  x,  ♦  i,  and  reraeraber  that 
••t(s.)  =  lltC^J  N'ow,  (8.22)  and  (5.23)  together 

yield 

E{x(t)lH’)  *  ♦(t,s)x(s)  for  t  k  s,  (.''.24) 

where  llj  :»  ll^(dy)  v  ll5(2»).  Nqxt  define  a  process  ii 
with  increments  du  «  dw,  -  R"^' ^Cf.d't,  which  c.nn  be  seen 
to  be  an  orthogonal  increment  process  of  type  (1.2)  such 
that  H(du)  c  1C.  It  is  not  hard  to  see  that  iiji.du)  lii^. 
In  fact,  in  view  of  (8.20),  du  *  R“^'^C(z  -  z,)cit  ♦  dw*, 
and,  since  H^(z,)  c  H^(r)  j.  H^(dw*)  [see  (8.16)],  (8, IT) 
and  (8.19)  imply  that  H{(du)  i  lC(z,);  also,  because  of 
(8.15),  H*(du)  1  H^(dy)  holds  trivially.  But 

dy  =  Cxdt  -  R^'^^du,  (8.25) 

and  therefore 

E{y(t)|H‘)  «  [pC(T)4>(T,s)dT]x(s)  (8.26) 

J  s 

follows  from  (8.24)  and  the  fact  that  lij(du)  i  Hj.  Now. 
since  iis(dy)  v  iljix)  c  II*,  (S.24)  and  (S.26)  iinp'iy  that 
(x',y')'  is  a  nVrkov  process,  .ind  consequently  it  h:is  a 
representation  (1.1).  Relations  (5.24)  -  (5.26)  insure 
that  A,  C  and  R  have  the  required  values.  0 

Finally,  to  further  stress  the  importance  of  the 
feedback  matrix  r.>  let  us  point  out  that  (8.3)  can  be 
written 

x(t)  •  x.(t)  ♦  Q.(t)Q(t)’^(t)  ♦  Q.(t)ri(t)  (8.27) 
where  x,  satisfies 

dx«  •  r.x.dt  ♦  B,R  ^^^dy;  x,(0)  »  0,  (8.28) 

and  z  and  r,  are  given  by  (8.16)  and  (8.Sb). 
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